Speech Compression using Analysis by Synthesis

نویسندگان

  • Minal Mulye
  • Sonal K. Jagtap
چکیده

Linear prediction plays a fundamental role in all aspects of speech. Its use seems natural and obvious since for a speech signal the value of its current sample can be well modeled as a linear combination of its past values. Calculation for predictor coefficients with the help of automatic code generation gives the solution for early and efficient computing. Automatic code generation is a fast paced technology with lots of new capabilities and customer experiences. In this, Simulink is the leading environment for modeling and simulating a broad range of dynamic systems. It is the foundation for Model-Based Design, with support for multi-domain modeling and simulation, automatic code generation for production use and real-time testing, and a range of capabilities for verification and validation. With its open architecture and APIs, the integration of models from other tools or languages such as C\C++ is possible. Also embedded coder with Simulink can generate C code for target specific applications. The proposed system implements a model based design by using the linear prediction coefficients of the encoded speech data and prove to be the promising method for speech compression.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of MPEG audio compression on HMM-based speech synthesis

In this paper, the effect of MPEG audio compression on HMMbased speech synthesis is studied. Speech signals are encoded with various compression rates and analyzed using the GlottHMM vocoder. Objective evaluation results show that the vocoder parameters start to degrade from encoding with bitrates of 32 kbit/s or less, which is also confirmed by the subjective evaluation of the vocoder analysis...

متن کامل

Speech compression a novel method pdf

Text summarization is a process that reduces the size of the text document. Purpose, we use part of speech tagging to recognize types of the text words. speech compression applications Compression rate is a scale to decrease the size of text summary. speech compression abstract A higher.This paper illustrates a novel method of speech compression and transmission. This method saves the transmiss...

متن کامل

Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss

In multi-band frequency compression, the speech spectrum is divided into a number of analysis bands, and the spectral samples in each band are compressed towards the band center by a constant compression factor, resulting in presentation of the speech energy in relatively narrow bands, for reducing the effect of increased intraspeech spectral masking associated with sensorineural hearing loss. ...

متن کامل

Biosignal Processing Applications for Speech Processing

Speech is a biosignal that is amenable to general biosignal processing methodologies such as frequency domain processing. This is supported today by the availability of inexpensive digital multimedia hardware and by the developments of the theoretical aspects of signal processing. However, sound processing must be also regarded through the prism of the psychoacoustic reality of the human hearin...

متن کامل

Low Resource TTS Synthesis Based on Cepstral Filter with Phase Randomized Excitation

In this paper we present the acoustic synthesis of a low resource Text-To-Speech (TTS) system based on a 7th order cepstral filter. The excitation signal is designed in frequency domain by a two parameter model. This model is able to generate the excitation signal for both, voiced and unvoiced segments. The sets of filter coefficients represent the speech units and are stored in a compressed fo...

متن کامل

Intelligibility analysis of fast synthesized speech

In this paper we analyse the effect of speech corpus and compression method on the intelligibility of synthesized speech at fast rates. We recorded English and German language voice talents at a normal and a fast speaking rate and trained an HSMMbased synthesis system based on the normal and the fast data of each speaker. We compared three compression methods: scaling the variance of the state ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014